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Abstract. It is known (Hofmann-Crcdner and Stolz (2008)) that the conver- 

^vj gcncc of the mean empirical spectral distribution of a sample covariance matrix 

y—( W„ = 1/n Y„Y^ to the Marcenko-Pastur law remains unaffected if the rows and 

^^ columns of Y„ exhibit some dependence, where only the growth of the number of 

CN dependent entries, but not the joint distribution of dependent entries needs to be 

5h controlled. In this paper we show that the well-known CLT for traces of powers 

^^ of W„ also extends to the dependent case. 

o 

(N 

1. Introduction 

Ph Sample covariance matrices are fundamental objects in multivariate statistical in- 

'^ ference that have found their way into random matrix theory. There, one usually 

'-^ studies them in the simplified form 

^ W„ = -Y„Y^, 

where 

> Y„ = {an{p,q))p=l,...,s{n);q=l,...,n 

is a rectangular array of independent real random variables, which are typically 



1^ 

00 

(T^ assumed to be centered, have unit variance, and satisfy uniform moment bounds 

sup max E(|a„(p, g)|'^) < oo (1.1) 

Cn „ p=l,...,s{n), 

Q q=l,...,n 

for all A; G N. 

If Ai, . . . , Xs(n) £ I^ denote the eigenvalues of W„ (with multiplicities), define their 

empirical measure as 

s(n) 



cn 

o 

(N 



X 



s(n) 



a L„(W„):=-^}_^5a„ 



where Sx. denotes the Dirac measure supported in Xj. In a random matrix theory 
context, one is typically interested in the asymptotics of L„(W„) as n — ?> oo and 

-^^-^y, 0<y<oo. (1.2) 

n 
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In statistical terms, if the n columns of Y„ each encode an observation of size s(n), 
this means that the number of variables under consideration is of the same order 
of magnitude as the sample size. This is at variance with classical multivariate 
analysis, but makes perfect sense in the context of modern data mining techniques 
(see the discussion of this point in the introduction of [3J). 

It is well known that in this regime the random probability measure L„(W„) con- 
verges almost surely weakly to the Marcenko-Pastur distribution with parameter 
y. This result may be interpreted as a law of large numbers. To understand the 
fluctuations about this limit, the family (tr(W„)V has been studied in the large 
n limit and found to be asymptotically Gaussian by Jonsson [B] , Cabanal-Duvillard 
[2|, and Kusalik, Mingo, and Speicher [7|, among others. See also Johansson [S] for 
the corresponding result for Wigner matrices. Particular interest has been devoted 
to "diagonalizing" the fluctuations in the sense of finding functions fi, . . . , fi such 
that, as ra — )■ cxD, (tr(/i(Wn)), . . . , tr(/;(W„))) tends to a family of independent 
Gaussians. By the cited work of Cabanal-Duvillard, the fi may be chosen as shifted 
Chebyshev polynomials of the first kind. (As observed by Johansson, in the Wigner 
case one may take, up to rescaling, the usual Chebyshev polynomials of the first 
kind.) 

These fluctuation results have been proven under the assumption that the en- 
tries of Y„ are independent. One may ask how many violations of this inde- 
pendence assumption may be tolerated for the limit theorems to remain intact. 
In |1], Hofmann-Credner and one of the authors proved that the convergence of 
Ln(Wn) to the Marcenko-Pastur distribution remains valid if one allows the entries 
of Y„ = {an{p,q))p=i^...^s{n);g=i,...,n to depend on each other in arbitrary ways, both 
within a column, and across different columns, as long as for any entry a„(p, q) the 
number of entries that depend on it does not grow too fast as n — )■ oo. This was 
made precise using a formalism that had been introduced by Schenker and Schulz- 
Baldes in [S] to study Wigner matrices with dependent entries. 

In the present paper, we extend the study of sample covariance matrices with de- 
pendent data to the level of fluctuations, again following the lead of Schenker and 
Schulz-Baldes, who in ^ undertook an analogous investigation in the case of Wigner 
matrices. We start in Section [2] with a precise formulation of the conditions we have 
to impose on the number of dependent entries. We then state in Theorem 2.3 our 
main result concerning the fluctuations, including the convergence to a Gaussian 
family and the diagonalization by shifted Chebyshev polynomials. In Section [3} 
some important facts about cumulants are summarized. We resort to those mainly 



in Section |4] which contains the proof of the first part of Theorem Z3 and some steps 
towards the second part. Sections [5| [6] and [7] complete the proof. Finally, in Section 
[8] we present an alternative approach to Theorem Z3 which relies on the results for 
Wigner matrices given in [9]. 

2. Set-up and main results 

In view of a physics application that will be discussed in a companion paper, we will 
state and prove our results in a slightly more general framework. We will allow the 
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entries of Y„ to be complex random variables, thus defining W„ := -Y„Y* , where 
the star means conjugate transpose, and let the size of Y„ be s{n) x t(n), where we 
assume that there exist /€,//€ (0, oo) such that 

lim = K and lim = /i. (2-1) 

n— >cxD n n— >oo fl 

We assume the entries an{p,q) {p = l,...,s(n), q = l,...,t(n)) to be centered 
of variance o"^ > 0, and to satisfy (1.1), where the maximum now runs over p = 



l,...,s(n), q = l,...,t(n). 

For a positive integer n write [n] for {1, . . . ,n}. Consider an equivalence relation ~„ 
on [s{n)] X [t{n)] and assume that the random matrix Y„ = (a„(p, q))p=i,...,s{n),q=i,...,t{n) 
has a dependence structure that is controlled by ~„ in the following way: The ran- 
dom variables a„(pi, gi), . . . , dnipj, qj) are independent whenever (pi, gi), . . . , (pj, qj) 
belong to j distinct equivalence classes of the relation ~„. On the other hand, there 
is no assumption on the joint distribution of the matrix entries that correspond to 
equivalent index pairs. To state our assumptions on the growth of the equivalence 
classes of ~„ we introduce the following quantities: 

/3oH:=#{(p,p',g) e [s{n)fx [tin)] : (p, g) ~„ (p', g) k p ^ p'] 

V # { (p, g, q) e [s{n)] X [t{n)f : (p, g) ~„ (p, g') & g ^ g'} , 

/3i(n) := max #{(p',g,g') G [s{n)\ x [t{n)f : (p,g) ~„ (p',g')} 

pe[s(n)] 

V niax #{(p,p',g') G [^(ra)]^ x [t{n)] : (p, g) ~„ (p',g')}, 

96[t(")] 

/32(n) := max # {(p', g') G [s(n)] x [t{n)] : (p, g) ~„ (p', g')} , 

(p,g)G[Kn)]x[t(n)] 

I3^{n) := max #{g' G [t{n)] : (p,g) ~„ (p',g')} 

pye[s{n)],ge[t(n)] 

pe[s{n)],g,g'G[t(n)] 

The following theorem is the starting point for the present paper: 

Theorem 2.1 ([4J). ///3oH = ^(n^-^), /3i(n) = ^(ri^-^) and (3 3(71) = 0{n') for all 
sufficiently small e > 0, then, as n ^ 00, t/ie mean empirical measure E(L„( W„)) 
converges weakly to a probability measure supported on a compact interval in M"*" 
with k-th moment equal to 






k\ I k \ j_i fc_j_|_i 



k ^-^ \i J \i — 1 



—y 

If fl = 1, this limit is the Marcenko-Pastur distribution with parameter k. 



Proof. This is [U Thm. 4.1], adapted to the present set-up by applying it to Y* in 
the place of Y„. This is reflected in the fact that /3i(n) is symmetrized, while the 
quantity that appears in (MPl) of [3] is not. Note furthermore that the empirical 
measure was defined slightly different in ^, and the assumptions were given by 
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(3o{n) = o{n^), /3i(n) = o(n^) and /33{n) = 0{1). However, the same result holds in 
our situation by essentially the same proof. D 

Remark 2.2. This result should be compared with the models that were studied in 
PP, where Y„ may be chosen, e.g., as a Toeplitz matrix. Clearly, the conditions of 
Theorem 2.1 on the /3fc(ra) are violated in this case, and in fact this class of models 
gives rise to new limit distributions as n — )■ oo. 

For the study of fluctuations we will need more restrictive conditions on the growth 
of the equivalence classes, which will be stated in terms of Po{n) and /32(^)- Note 
that (3i{n) < {s{n) V t{n))(32{n) and /33(n) < /32{n). So the assumptions of the 
following theorem, which is the main theorem of the present paper, jointly imply 
the conclusions of Theorem 2J^ To state our main result, we need to introduce a 
family of orthogonal polynomials that will be discussed more extensively in Section 
[T] below. Denote by T^ {k G N) the monic Chebyshev polynomials of the first kind, 
with orthogonality measure J^ - For any y G (0, oo). A; G N and {^/y — 1)^ < x < 

{^/y + 1)^ define 



n(.) := v?n (^^^) 



and 



X 



Tl{x,a):=a"^Tl,^ 



The following theorem is the main result of this paper. 

Theorem 2.3. 

(a) If P2{'n) = O {n'^) for all e > 0, then, as n —^ oo, the random vector 

(tr(W^^),...,tr(W^O) 
(where j G N, fci, . . . , /cj G Nj converges in distribution to a jointly Gaussian 
vector (whose components are not necessarily nondegenerate) . 

(b) //, in addition, Po(ji)l32{n)^ = oiji^) for all rj > 0, then there holds 

Cov(tr(r^^( Wn, (r)),tT{Tl{ W„,, a))) = <5,fcf/„(2j) + o(l), 
where y = k/jj, and Un{2j) is given in (6.8) in Sectionl^ below. 

3. Some facts about cumulants 

Let Xi, . . . ,Xj be random variables defined on a common probability space with 
moments of all orders. Then, the characteristic function 



<^Xu-,x,{ti, . . . , tj) = E exp ^ itiXi 



.1=1 

is infinitely differentiable inti, . . . ,tj. The joint cumulant Cj{Xi, . . . ,Xj) is defined 
as 

Cj{Xi,...,Xj) = i ^^^——^\og(px^,...,x,{ti,...,tj) 

In particular, we have 

Ci(XO = EXi, C2(Xi,Xi) = VarXi, C2(Xi,X2) = Cov(Xi,X2). 
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In general, the joint cumulant can be expressed in terms of the mixed moments of 
Xi, . . . ,Xj. Specifically, we have the moment- cumulant formula 

c,(x„...,x,)= E (-i)*'^"'(#^-i)'ri^f n ^^)' (3.1) 

where V{j) is the set of all partitions of {1, . . . , j}, and for n E V{j), #7r is the 
number of blocks of vr, which are denoted by i?i(7r), . . . , _B#^(7r) (cf. [10, 11.12]). 
The formula above immediately implies that the joint cumulant is symmetric and 
multilinear. The following two lemmata will be frequently used in the subsequent 
sections. 

Lemma 3.1. // there is a partition of {l,...,j} into two nonempty subsets M 
and N such that the families {Xi,l G M} and {Xi,l G N} are independent, then 
C,{X,,...,Xj) = 0. 

Proof. As a consequence of the symmetry, we may assume without loss of generality 
that M = {1, . . . , /} and N = {I + 1, . . . ,j} for some / = 1, . . . , j — 1. Due to the in- 
dependence, we have ipxu-,Xj(ti, . . . ,tj) = <^Xi,...,Xi(ti, . . . ,ti)ipxi+^,...,x,(ti+i, . . .,tj). 
It thus follows that 

log^x,,...,x,{ti,...,tj) 



dti- ■ ■ dtj 



log (^Xi,...,Xi {ti,...,ti) + — log ^Xi+u...,X, {ti+i, ...,tj)= 0, 



dti---dtj ^^-i'-'-'^ ^'•••' '^ dti---dtj 

implying C,(Xi,...,X,) = 0. D 

Lemma 3.2. The vector (Xi, . . . , Xj) has a Gaussian joint distribution if and only 
if Ci(Xjj, . . . ,XjJ = for any I > 3 and ii, . . . ,ii G {1, . . . , j}. The distribution is 
non- degenerate if and only if C2{Xi, Xi) > for any i G {1, . . . ,j}. 

Proof. It is well-known that a Gaussian vector (Xi, . . . ,Xj) with mean vector /i and 
covariance matrix S satisfies 

^Xu...,x,{ti, ...,tj)= exp lit^fi - ^t^^tj , 
where t^ = (ti, . . . , tj). Thus, any partial derivative of 

logv5xi,...,x,(ti, ...,tj) = it^n - ^f^^t 

of order greater or equal to 3 vanishes. On the other hand, the Gaussian distribution 
is uniquely determined by its moments and consequently, by its cumulants. D 

4. The asymptotic vanishing of higher cumulants 

To establish a Gaussian limit, we will show that the joint cumulants of order j > 3 of 
(tr(Wj!j))fcgr^ asymptotically vanish as n — )■ oo. As a first step towards this goal, we 
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expand each trace tr(W„) in a way that makes it possible to exploit the information 
that is available about the dependence structure among the entries of Y„. We obtain 

tr(W'=) = ^Y1 <PMP2) ■■■■■ a{P2k^MP^), 
p 

where the sum is over all families P = (P/)/=i,...,2A: of pairs Pi = {pi, qi) that satisfy 

• Pi e [s] X [t] for all/ = 1, . . . , 2A; and 

• 'P21 = P21+1 and q2i-i = q2i for all I = 1, . . . ,k, where 2A; + 1 is cyclically 
identified with 1. 

Here we have dropped explicit reference to the dependence of s{n),t{n) on n. By 
the multilinearity of cumulants, this expansion implies that for fci, . . . , fcj G N, k : = 

ki + . . . + kj one has 

Cj(tr(W^'i),...,tr(W'^0) 

. / ki kj \ 

= ^kJ2^A n«(^i-2/-i)a(Pi,2/),...,n«(^.-.2/-i)a(P,,20 , (4.1) 
p y=i 1=1 J 

where the sum is now over all doubly indexed families Pi^i (i = 1, . . . , j; / = 1, . . . , 2ki) 
satisfying 

(Bl) Pu e [s] X [t] for alH = 1, . . . , j; 1 = 1,..., 2ki and 

(B2) pi^2i = Pi,2i+i and qi,2i-i = qi,2i for all i = l,...,j; / = 1, . . . , fci, where 
2ki + 1 is cyclically identified with 1. 

Write 

M:={{i,l) |z = l,...,j; 1 = 1,. ..,2k^}. 
This means that the sum in (4.1[) is over all maps P : Ai ^ [s] x [t] such that (B2) 



is satisfied. For fixed i, in accordance with [9], we will sometimes refer to the family 

{i, 1), (z, 2), . . . , (z, 2ki) as the i-circle of A^. 

Now, on [s] X [t] one has the equivalence relation ~„ that governs the dependence 
structure of the random matrix Y„ = {an{p,q)). So any map P : Ai ^ [s] x [t] 
induces an equivalence relation ~p on Ai via 

{t,l)r^p{7f,n :^^ P.^i^nPv^v. (4.2) 



In the sum in (4.1 ), we will group the maps P together according to which partition 
they induce on Ai. Write V{Ai) for the set of all partitions of Ai, and for any 
71 G 'P(A^) denote by M„(7r) the set of all P : A^ — )■ [s] x [t] such that the equivalence 
classes of ~p form the partition vr. Then (4.1) reads 

C,(tr(W'=0,...,tr(W^'0) (4.3) 

= ^k Y. E cAl[aiP^,2l-MPl,2l),...,l[a{P,,2l-MP,,2l] 

If for any choice of nonempty disjoint subsets M, A^ C [j], M U A^ = [j], there are 
i e M, I e [2ki], i' e N and /' G [2ki>] such that (i, /) ~^ {i',l'), we say that tt is 
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connected and write tt G 'P^(A^). If vr is not connected, then we can find a partition 
of [j] into nonempty subsets M, N such that 

n«(^M,2Z-i)a(^/.,2z) : /ieM 
and 



are independent. By Lemma 3.1 above, this imphes that for any non-connected tt 



the corresponding summand in (4.3) vanishes. We may thus restrict the sum to 
connected partitions. 

We call {i, I) E Ai a. connector of tt G V^lAi) if there exist i' ^ i and /' G [2ki/] such 
that {i,l) ~,r i'i'J')- It is a simple connector ii, in addition, {i,l) 7^^ {'i-J') for all 

rG[2fc,], r^/. 

Lemma 4.1. Let vr G P'^(A^), j > 3, and assume that (32{n) = O (n^) for all e > 0. 
Then, as n —)■ 00, 



PeM„(7r) 

•= ^ 5Z ^AW <Pl,2l-lHPl,2l), • • • , n «(^,,2i-l)«(^i,2/) 
PeM„(7r) y=l i=l 

= 0(1). 

Proof. If TT has a singleton block {(«,/)}, then a{Pi^i) is a centered random vari- 
able that is independent of all other a{Pi'^ii). So the expectation of any product 
of matrix entries that contains the term a{Pi^i) vanishes. Hence, by the moment- 



cumulant formula (3.1), Cjiji) = 0. Consequently, we need only consider partitions 



7r with k = ki + . . . + kj blocks or less. 

Now we wish to find an upper bound for the cardinality of M„(7r). To this end, 
we set out to construct an arbitrary sequence P = {Pii)i=i,...j- i=i,...,2ki £ M„(7r), 
starting with Pu = {pu, qu). There are s ■ t possible choices for this pair. Coming 
to P12, qi2 is already determined by (B2). As to pi2, we have to consider two cases: 
(1, 1) and (1, 2) may or may not belong to the same block of vr. In the first case, we 
have at most (33 choices for pi2, in the second case at most s. Proceeding to (1,3), 
this time it is pis that is fixed by (B2), and for gi3 we have /S^ or t choices according 
to whether or not a new block of vr is reached. In this manner we proceed cyclically 
along {1} X [2ki] until we reach (1,2A; — 1), where the corresponding pair Pi,2fci is 
already fixed by (B2) and the cyclic identification of 2ki + 1 with 1. Since -Pi,2fci 
may or may not belong to a block that has been reached before, this observation 
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only reduces our upper bound by a factor which is 0{n^) for all e > 0. Below we 
will encounter a situation in which the bound is reduced more substantially. 
Returning to the present bounding exercise, as vr is a connected partition of A^, we 
have already fixed the pair Pi; = {pu,qu) for a connector (1,/) and thus imposed 
restrictions on the choice of the pair {pi^u, qi^v) for some i 7^ 1 and /' e [Iki]. In fact, 
there are at most (32 choices for this {pi,i' , qi,i') ■ Moving cyclically along {i} x [2ki] 
we proceed as above, then turning to the remaining circles {i'} x [2fcj/], all of which 
may be reached via connectors since vr was assumed to be connected. Since /^s < (32, 
we thus may bound 

# M„(7r) < s{n)t{n){s{n) V t{n))*''~^ (32{nf''~*^ = O {n*^+^+') Ve > 0. (4.4) 



By the moment-cumulant formula (3.1 ), Holder's inequality, and the uniform bound 
(1.1) one sees that 

sup sup \Cj{P)\ < 00, 

n PeM„(7r) 

and in view of the pref actor n~^ , this implies that for ix to contribute to the limit 
it is necessary that t^tt > A; — 1. So we have shown that it suffices to consider 
connected vr with exactly k 01 k — 1 blocks. 

We are now going to find further necessary conditions for such a tt to give a nonzero 
contribution to the limit. We have seen that partitions with a singleton block do 
not contribute to the sum. So, if #7r = k, ti must be a pair partition, in the sense 
that all blocks of vr consist of exactly two elements. If t^tt = k — 1, either vr has 
two 3-blocks and /c — 3 pairs, or one 4-block and k — 2 pairs. We claim that in all 
three cases vr has a simple connector. In fact, vr was assumed to be connected, and 
if it is a pair partition, then all connectors have to be simple. If vr has only blocks 
of size 2 or 3 and if (i, /), {i', I') with i j^ i' belong to the same block of vr, then one 
of {i,l) and (i', I') is simple. Finally, consider the case that vr has a single 4-block 
and is otherwise a pair partition. If there is a 2-block that connects two different 
circles, then it consists of simple connectors. Otherwise, for vr to be connected it 
is necessary that the 4-block connect all circles. Invoking now the assumption that 
j > 3, we see that at least two elements of the 4-block must be simple connectors. 

Now suppose (possibly after relabeling) that (1, 2ki) is a simple connector of it. 
If we construct P as above, starting with (1, 1) and proceeding along {1} x [2ki], 
{pi,2ki,qi,2ki) is fixed by (-B2) and the cyclic identification of 2ki + 1 with 1. This 



time, in contrast to the above argument that led to the bound (4.4), the fact that 
(1, 2ki) is a simple connector guarantees that it is in a block that has not yet been 
reached, and so our upper bound gets reduced by a factor of order n. In total, the 
block of (1, 2ki) contributes a factor of order [32{n), and we end up with the bound 

#M„(7r) < s{n)t{n){s{n)yt{n))*^-^(52{nf^-*^^^ = O (r2#"+") . 

In view of the prefactor n~'', partitions vr with ^tt = /c — 1 do not contribute to 
the limit. If t^tt = k, then vr is a connected pair partition, and since j > 3, there 
is a circle that is connected to two distinct circles by simple connectors. Without 
loss of generality, assume that this circle is {2} x [2k2] and that it is connected 
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to {1} X [2ki] and {3} x [Zks] via the 2-blocks {(2, 1), (1, 2ki)} and {(2, /*), (3, 1)}. 
Repeating the above counting exercise, one obtains that these blocks contribute a 
factor (32{n) rather than 0{n). So we arrive at #M„(7r) = O (n*'^""'^^^), yielding a 
vanishing contribution in the limit in view of the prefactor of order n~'^. D 

5. The covariances: Reduction of partition types 
The next step is to study the covariances, i.e., the case j = 2. The proof of Lemma 



|4.1| implies the asymptotic negligibility of the following types of partitions: 

• Connected partitions with a singleton block. 

• Connected partitions tt with ^vr < k — 2. 

• Connected partitions vr with ^vr = k — 1 and a simple connector. 

So it remains to consider two types of partitions: 

(PPl) Connected pair partitions. 

(PP2) Partitions vr with ^vr = k — 1, consisting oi k — 2 pairs and one 4-block. 
The 4-block contains two elements of each of the two circles {1} x [2ki] and 
{2} X [2^2], and all of them are connectors. Each of the pairs consists of 
two elements from the same circle. 

Following p] we denote the set of all partitions of type (PPl) or (PP2) by the slightly 
misleading symbol 'P'Pi2ki]u\2k2]^ even though partitions of type (PP2) are not pair 
partitions. 

Within 'P'P\2ki]u\2k2] ^^ ^^^ going to identify further types of partitions which give 
vanishing contribution to the limit. An auxiliary notion that will be useful in what 
follows is an interval on a circle {i} x [2ki]. We cyclically identify 2ki + 1 with 1 and 
set for/ ^Z' in [2ki]: 

X-lvl//'r -J^ ifr = / + l. 

Closed and half open intervals are defined in the obvious way. A partition vr G 
'P'P\2ki]u\2k2] ^^ called crossing if there are {i,h) ~7r (^,^2) and mi G ]li,l2[i, ^^2 G 
]/2, h[i such that one of the following holds: 

(Cross 1) {i^rrii) ~^ (^,^2) 

(Cross 2) {i,mi) and {i,m2) are connectors. 

This notion can be visualized as follows: Draw the circle [2ki\, and around it the 
circle [2^2] ■ Now connect any two equivalent points by an internal path. A partition 
is non-crossing if this can be achieved without lines crossing each other (Figure Qj). 

Lemma 5.1. Suppose that f32{n) = 0{n^) Ve: > and let vr G 'P'P\2ki]u\2k2] ^^ 
crossing. Then 

^#M„(7r) = o(l), 

where k = ki + k2- 
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Figure 1. fci = 2, ^2 = 3. The first picture shows a crossing par- 
tition satisfying (Cross 1), the second a crossing partition satisfying 
(Cross 2), and the third one a non-crossing partition. 



Proof. By relabeling if necessary, we may assume that for a suitable / G [2ki] we 
have that (1,1) ~7r (1,0 ^"^^ ^^^^ there exist mi G ]l,/[i, wi2 ^ ]^l[i such that 
(Cross 1) or (Cross 2) hold. We claim that 



[I, nil) is not equivalent under vr to any other point of {l}x ]l,/[i. 



(5.1) 



This is because otherwise, there is j G ]l,/[i, j 7^ mi such that {l,m,i) r^n (1, j)- 
By (PPl), (PP2), this implies that (1,^1) 7^,^ (1,'^2), hence (Cross2) must hold. 
So (l,?Tii) must be a connector, which implies that (1, j) and (l,mi) are contained 
in a 4-block. So (l,'m2) should belong to a 2-block which by (PP2) consists only 
of elements of the first circle. So (1,7712) is not a connector, contradicting (Cross2) 



and thus proving (5.1). 



We also claim that either 



n contains a 4-block 



or 



there is a simple connector (1, 777,3) with 7773 G ]/, l[i. 



(5.2) 



(5.3) 



Assume that (5.2) is not the case. If (Cross 2) holds, then ( |5.3 ) is satisfied with 



7773 = 7772. If (Cross 1) holds, then there is a connector (1,7773). It is simple because 
vr is a pair partition, and 1 7^ 7773 7^ /, because tt does not contain a 4-block by 
assumption. We may assume that 7773 G ]/, l[i, because otherwise we may resort to 
a cyclic relabeling that maps (1, /) to (1, 1), and hence (1, 1) to (1, 2ki — 1 + 2), and 
then consider I' := 2ki — / + 2 in the place of I. 



Suppose now that vr G VVl 



[2fci]U[2A:2 



is crossing and has properties (5.1) as well as 



(5.2) or ( |5.3 ). We are going to bound # M„(7r). First we choose Pn, P12, • • • , Pi,mi-i, 
having s{n)t{n) choices for Pn and at most s{n) V ^(77) or P^ln) < P2{n) choices for 
the other pairs according to whether or not a new block of it is reached. Then we 
make our choice for Pi^i (there are at most f32{n) ways to do so since (1,1) ~7r (1, 0) 
and proceed to lower second indices, Pi^i, Pi,z-i, . . . , Pi,mi+i- Now, Pi,mi is fixed by 



(B2). On the other hand, by (5.1), the block of (l,?77i) is reached for the first time. 
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SO a factor of s{n) V t{n) is "lost" , and 



If (^ holds, then #7r = A; - 1, and ^# M„(7r) = o(l). If Q holds, then we may 
argue as in the last paragraph that the block of the simple connector (1,7713) foils 
to contribute a factor of s{n) V ^(77), hence ^#M„(7r) = o(l). D 

Lemma 5.2. Suppose that f32{n) = 0{n'^) Ve: > and f3o{n) f32{n)'^ = 0(77^) V77 > 
0. Consider tc G 'P'P\2ki]u\2k2] ^'^^ assume that it contains a block of the form 
{(7,/), (7,/ + 1)} with 7 G {1,2},/ G [2A;j]. Remove this block from tt, t/iTts obtaining 
a partition tt' . Then we have that 

i^#M„(7r)<C^#M„(7r') + o(l), 

where k := ki + k2 as usual. 

Proof. Assume without loss that the block in question is {(l,2A;i — 1), (1, 2A;i)}. 
First, consider those elements of M„(7r) which satisfy pi^2ki~2 = Pii- By condition 
(B2), any such element can be constructed from an element of M„(7r') and a choice 
for gi,2fci-i = <li,2ki- Note that there are t(77) choices since {(l,2fci — 1), (l,2fci)} is 
a block of TT. 

Coming to those elements of M„(7r) with pi,2fci-2 7^ Pn, (B2) implies that pi^2ki-i 7^ 
Pi,2ki and gi,2fci-i = ?i,2fci- To bound the number of these elements, observe that 
there are in total at most f3o{n) choices for the pairs {pi,2ki-i,(li,2ki-i) and (pi,2fci5 (li,2ki] 
Assume now that tt has a simple connector (1, /), where we necessarily have that / ^ 



{2ki — 1, 2ki}. Arguing as in the proof of Lemma 5.1 , one sees that the block of (1, /) 
affords at most f32{n) choices. So the number of elements of M„(7r) with pi,2fci-2 7^ Pii 
and a simple connector can be upper bounded by {s{n)\/t{n))'^'^~'^ (]o{n) l32{nY^~'^'^~^^ . 
In view of (PPl), (PP2), in the present case we have ^tt = k, so that we obtain a 
bound of {s{n) y t{n)Y~'^o{n^) = o{n'^). 

If TT has no simple connector, then by (PPl), (PP2) it has a 4-block, and we get a 
contribution that is upper bounded by s{n)y t{n)Y^~^^~^ (3Q{n)(32{ny''^'^'^'^^ = o{n''). 
In total, 






U 



We know from Lemma |5.1| that crossing partitions are asymptotically negligible. 
We will apply Lemma 5^ to see that this is also the case for certain non-crossing 
partitions. 

From now on we will always assume that (32{n) = 0{n^) Ve > and f3o{n) f32{n)'^ = 
0(77^) V7/ > 0. 

Corollary 5.3. Let tt G 'P'P\2ki]u\2k2] ^^ ^ non-crossing partition with exactly two 
connectors on each circle. Then the connectors form aA-block, orvr is asymptotically 
negligible. 



12 OLGA FRIESEN, MATTHIAS LOWE, AND MICHAEL STOLZ 



Proof. Assume that vr has no 4-block. By iteratively applying Lemma 5.2 and suit- 
ably relabeling we obtain a partition n' G 'P'P\2k' \u\2k' ] "without nearest neighbor 
pairs, and we have a bound 

^# M„(7r) < c47# M„(7r') + o(l), 

where k = ki + k2 and k' = k[ + k^- It is clear that vr' must be non-crossing, and 
since it has no nearest neighbor pairs, all points must be connectors. So k'^^ = k'2 = 1. 
The 1-circle is of the form {pu^qu), (^12,^12), and by (B2) one must have pn = 
P12, 5'ii = Q'12- This contradicts the assumption that vr, hence vr', has no 4-block, 
which implies that (pii,gii) t^tt' {pi2-,qi2)- So #M„(7r') =0. D 



On the other hand it is easy to see that if the connectors of vr form a 4-block, then 
vr will give a nonzero contribution to the limit. In view of these results we define 
A/'PT^I^ iLjf2fc2l ^^ ^^^ ^^^ °^ ^^^ non-crossing partitions in 'P'P\2k^]vj\2k2\ "^i^h a 4-block, 
and for m > 2 we define A/'P'PS^ iLjf2A:2l ^^ ^^^ ^^* °^ ^^^ non-crossing partitions in 
■^■^f2fciiuf2fc2l *ith exactly 2m connectors on each circle. 

To identify one further class of non-crossing partitions with asymptotically vanishing 
contribution, we follow [^ and introduce the notion of a dihedral partition, which 
means that neighboring connectors in the first circle are connected to neighboring 
connectors on the second circle. Here connectors {i,l),{i,l'), I ^ I', are neighboring 
if one of the intervals {z}x]/,Z'[j or {z}x]/',Z[j contains no connectors. Denote by 

PAAPp2mp[2fc2] ^ -^^^[2Ti]u[2fc2] ^^^ ^^^ ^^ ^^^ dihedral partitions of [2ki] U [2^2] 
with precisely 2r7i connectors on each circle. Note that the number of connectors 
on a circle differs from the length of a circle by an even number of points. For 
/ci = ^2 = m > 2, it will be useful later to characterize 1^J^'P'Pf2!rli]u[2m] ^^ terms of 
the dihedral group D^m- Recall that D^m is the subgroup of the symmetric group 
Sym(2m) that is generated by the 2m-cycle (1 2 3 . . . 2m) and the transposition 
(1 2m) (2 2m — 1) . . . (m m + 1). Then the elements of l^-^'P'Pf^lli^ijnm] ^^^ precisely 
the partitions of [2m] U [2m] of the form {{(1, r), (2, g{r))}, r = 1, . . . , 2m} for some 

9 e Birn- 

Lemma 5.4. If n E J^'P'Pf2k^]ij\2k2] '^^ "'^^ dihedral, then it is asymptotically negligi- 
ble. 

Proof. We eliminate all nearest neighbor pairs from vr and obtain vr' G ■^'P'Pf2m\u\2m\^ 
which consists only of connectors. Observe that vr' is not dihedral. In view of 



Lemma 5.2, it suffices to prove that vr' is asymptotically negligible. Observe fur- 
thermore that m > 2, because otherwise, all points on a circle are neighbors, and 
all partitions are thus dihedral. In particular, it follows from (PP2) that vr' con- 
tains no 4-block. Assume without loss that (1, 1) ~,r' (2, 1) and (1, 2) ~7r' (2, /) 
with / ^ {2, 2m}. We start the usual bounding exercise at (1, 3), for which we have 
s{n)t{n) choices. Since all points on the 1-circle are simple connectors (as no 4-block 
exists), (1, 4), ... , (1, 2m) afford (s(n) V t(n))^™~^ choices. Choosing index pairs for 
all points on the 2-circle except (2, 1) and (2, /), we get a bound of /32(n)^™'~^, since 
no new block is reached in the process. By (B2), then, P21 and P21 are already 
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determined, and for each of Pii,Pi2 there are at most f32{n) choices. Summing up, 
we obtain that 

# M„(7r') < {s{n) V t{n)f"'-^P2{nf"" = 0{n^"'-^+') W > 0. 

Since k — 2m pairs have been removed. Lemma |5.2| imphes what was claimed. D 

Summing up, we have reduced the calculation of covariances to the expression 

C2(tr(W'^i),tr(W'=2)) 

-. fciAfc2 

= ^kll E E C2iaiP,),aiP2)) + oil), (5.4) 

where for i = 1 , 2 we have written 

a{P;) := J] a{P,^2i-MP^)- (5-5) 

1=1 

6. The covariances: reduction of multi-indices 

We now revisit the process, described in the previous section, by which we assigned 
to TT G ^■A/'PP^2'fci]u[2A;2] ^ partition vr G ^■A/'PPp^jupm] by successively eliminating 
all 2-blocks of vr on the same circle. One step of the process consists in removing a 
block (i, /) ~7r (^, ^ + 1) and then relabeling the remaining pairs 

{i,r), j = iandr<l, (6.1) 

(i, r — 2), j = i and r > / + 2, 

if / 7^ 2/cj and 

(^,2A;,-2), J =2 and r = 2, (6.2) 

(i, r — 2), j = « and r 7^ 2, 

if / = 2A;j. The partition vr is then obtained by repeating this step until only con- 
nectors are left. We are now going to study the multi-indices that are consistent 
with 71 and if. To this end, we introduce a subset PM„(7r) C M„(7r) such that any 
P = (Pi^i) G PM„(7r) has the additional property 

(P) (z,/)~^(2,/') => Pi,i = P^,V. 

Note that for tt G 'PA/'PPp^]LJ[2m] ^s above one has that PM„(7r) = M„(7r). Now, 
for P G PM„(7r) we define P G PM„(7r) = Mn(7r) to be the multi-index that remains 
after eliminating in each step an equivalent nearest neighbor pair and then relabel- 



ing by the same procedure as in (6.1) and (6.2). To see that P is indeed consistent 
with TT, consider the first nearest neighbor pair (i,/) ~7r {hi + 1) which has to be 
removed. By (P), we have that Pi^i = Pi^i+i- Since P satisfies (B2), we obtain that 
either qi_i = qi+2 or pi_i = pi+2 depending on whether / is even or odd. This im- 
plies that condition (B2) is still satisfied after {{i, I), {i, / + 1)} has been eliminated. 
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Furthermore, the elements of the multi-index P' which remains after this first step 
are relabeled in the same way as those of the partition it', that is P' is consistent 
with 7c'. The same argument can be applied for any following nearest neighbor pair. 
Consequently, P is vr-consistent. 

The following lemma ensures that the complement of PM„(7r) does not contribute 
to the limit. 

Lemma 6.1. Let vr G ^A/'P^pXiupfea] ■ Then 

^#(M„(7r)\PM„(7r))=o(l). 

Proof. If ki = k2 = m, then M„(7r)\PM„(7r) = 0, and there is nothing to prove. 
Thus, let ki > m or k2 > m. Since vr G 'PA/'PPpfeilu[2A:2] ^^ non-crossing, it contains 
at least one nearest neighbor pair. Without loss of generality, we assume that 
{(1,2/ci — l),(l,2A;i)} is such a 2-block. Let tt' denote the partition obtained by 
eliminating this block. First of all, we want to verify that 

^# (M„(7r)\ PM„(7r)) < C7-^# (M„(7r')\ PM„(7r')) + o(l). (6.3) 



To this end, we simply need to mimic the proof of Lemma 5.2 Thus, take some 
P G M„(7r)\PM„(7r). If Pi,2fci-i = Pi,2ki, then the reduced multi-index P' is con- 
sistent with vr', and (P) is still not satisfied by P'. In other words, we have P' G 
M„(7r')\PM„(7r'). This allows us to conclude that any element P with Pi,2fci-i = 
Pi,2ki can be constructed from an element of M„(7r')\PM„(7r') and a choice for 
Q'i,2fci-i = (li,2ki- The latter admits t{n) possibilities. However, if Pi,2A:i-i t^ Pi,2ki, 
we can conclude that pi,2fci-i 7^ Pi,2ki since gi,2fci-i = (li,2ki by (B2). In particular. 



Pi.2ki-2 7^ Pi.i- This situation has already been analyzed in the proof of Lemma 5.2 
and led to the upper bound o{n'') for the number of elements in M„(7r) such that 



Pi,2ki-2 7^ Pi,i- To sum up, (6.3) holds. 



Applying this estimate successively, we arrive at 

^# (M„(7r)\ PM„(7r)) < c4^# (M„(7r)\ PM„(7r)) + o(l), 
n n 

where tc G ^A/'PP^a'miupm] i^ the reduced partition described at the beginning of 
this section. However, PM„(7r) = M„(^), implying that # (M„(7r)\PM„(7r)) = 0. 
This completes the proof. D 

Hence we may replace M„(7r) by PM„(7r) in (5.4) to obtain 

C2(tr(W*=i),tr(W'=2)) 



, kiAk2 

tE E E C2(a(Pi),a(P2)) + o(l), (6.4) 



m=i -e©ArpP[|™ ,^p,^, P6PM„(-) 



Now, note that {i, I) ~vr {i, I') in particular implies that either / is odd and /' is 
even or vice versa. This is due to the fact that we have a non-crossing partition on 
a set of even cardinality. Consequently, exactly one of the elements a{Pi^i) , a{Pi^if) 
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appears as its complex conjugate in the covariance C2(a(-Pi), a(P2))- Moreover, by 
property (P), Pjj = Pii'. Thus, each equivalent pair on the same circle contributes 
a factor E(|a(Pj^;)p) = cr^ to the covariance. So we obtain 

C2(a(Pi), a{P2)) = a2'^i+2;c,-4n^ C2(a(A), ^(A)), (6.5) 



where we have used the shorthand (5.5). This relation indicates that it will be 



sufficient to consider the set M„(7r) of all reduced multi-indices instead of the set 
PM„(7r). To make this statement more precise, we want to proceed by counting 
the number of multi-indices P G PM„(7r) that lead to the same reduced multi- 
index P G M„(7r). Therefore, take a partition vr G T^J^'PVf^^-^yj^2k2] ^^^ ^^° indices 
{i, I) ~7r {i, I') which are not connectors. Define the initial point of {{i, I), {i, /')} as 

,,. ,, ,. ,,., f /, if 1 /,/'[,• does not contain connectors, 
IM', I), {', I )) := |,,_ jf ,,,_ ,|^ ,^^^ ^^^ ^^^^^^ ^^^^^^^^^^ 

Note that 7^((z,/), («,/')) is well-defined since vr G T^J^'PVf^^p^2k2] ^^ non-crossing, 
implying that there is no connector in ]/,/'[j if and only if there is at least one in 
]/',/[»• III particular, we say that a pair {(i, /), (i, /')} of equivalent points on the 
same circle is even if 7jr((«,/), {ij')) is even. Otherwise, we call the pair odd. Now 
we put 

even(7r) := #{{(«, /), (^, /')} g vr : z G {1, 2}, /, /' G [2A;,], {(2, /), (^, I')} is even}. 

If we take some P = {{pi,i,qi,i))i=i,2, i=i,...,2k, e PM„(7r), then by (B2) even(7r) can be 
characterized as the number of pairs (p,^/, qi^i) ~„ {Pi,i', qi,i') with •jniih 0) (^5 ^')) = ^) 
such that the element Pi^i = Pi^i' is not determined by the pairs (j)ij,qij) with 
j G ]l',l[i- However, in this case qi^i = qiji is uniquely determined by those pairs 
since qi^i = qi^i-i if / is even. 

Lemma 6.2. Let vr G VMVV\^^-^^^2k2] «^^ Q ^ M„(7r). Put 

PM„(7r; g) = {P G PM„(7r) : P = q} . 



VFe /iaf e 

1 



ki+k2 — 2m 



n 



# PM„(7r; g) = Keven(7r)^/ci+fc2-im-even(^) ^ ^(^^j^ (g g^ 



Proof. For fixed ?ti, we prove the statement by induction over 2k = 2ki + 2^2- Since 
vr G ^A/'PPp^j]u[2fc2ii the smallest value 2k can take is 2k = Am. In this case, we 



can conclude that vr = vr, implying #PM„(vr;g) = 1 and even(7r) = 0. Thus (6.6) 
holds without the term oil). 



Now, suppose that (6.6) is true for 2k = Am + 2{j — 1) with j > 1, and consider a 
vr G T^-^'P'P'[2ki]u[2k2] with 2k = 2ki + 2A;2 = 4m + 2j. Since vr is non-crossing, there 
is an index (i, /) such that (i, /) ~7r (^? / + 1). Without loss of generahty, we assume 
that i = 1. Consider the partition vr' G ^A/'PP?2'fei-2iu[2fc2] obtained by eliminating 
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the block {(1,0; (1)^ + 1)}) ^^'^ relabeling as in (6.1) or (6.2). Then the inductive 
hypothesis guarantees that 

^^,^_,l,^_,^ #PM„(7r';g) = ^even(.')^{/=l-l)+/c.-2™-even(.') +^(1). 

Now, it is possible to extend any multi-index P' G PM„,(7r';Q) to a multi-index 
P e PM„(7r;Q) by specifying Pi^i = {pi,i,qi,i) and Pi,z+i = (pi,;+i, gi,i+i). Since 
]l, / + 1[]^ = contains no connectors, we conclude that 77r((l, /), (1, ^ + 1)) = i- First 
suppose that / is even. In this case, the consistency condition (B2) yields that the 
elements qi^i = qi^i+i are already determined by P', and we only have to choose 
Pi,i = Pi,i+i- There are at most s{n) possibilities to do so. This leads to the upper 
bound 

#PM„(7r;g)<s(r2)#PM„(7r';Q). 

To find a lower bound, note that the fact that {{i, I), {i, / + 1)} is a block of vr implies 
that {pi^i, qi^i) = (pj^/+i, gjj+i) is not in any ~„-block of any index pair from P'. (This 
is a requirement that could be safely neglected in the previous bounding exercises 
that aimed at upper bounds.) Since vr' has at most 2m + j — 1 blocks, we obtain 
the estimate 

# PM„(7r; Q) > (s(n) - (2m + j - 1) P2{n)) # PM„(7r'; Q) 

= (sin) - O {n')) #PM„(7r';Q). 

Since by assumption, ^^ — )■ k as n — )■ oo, we can combine these bounds to obtain 

Note that any equivalent pair (j, r) ~^ (j, r'), (j, r) ^ {(1, /), (1, / + !)}, is even with 
respect to it if and only if its relabeled version is even with respect to tt'. Since 
further {(1, /), (1, / + 1)} was chosen to be even, we obtain even(7r) = even (tt') + 1. 
This concludes the proof for the even case. 

Now suppose that / is odd. This implies that the element pi^i = pi^i+i is already 
determined by P' and we need to choose qi^i = qi^i+i- This time, there are at most 
t{n) possibilities, and -^ — )■ /i. Proceeding as in the even case, we see that 

^,j,^_,^ #PM„(7r;Q) = ^evcn(.')^fci+fc2-2m-even(.') + ^(1). 

Since ■j.,,{{l,l), {1,1 + 1)) = / is odd, the identity even(7r) = even (vr') holds, which 
proves the second case. 

D 



Now, using (6.5), Lemma 6.2, and the shorthand (5.5), equation (6.4) becomes 



C2(tr(W'=i),tr(W''2)) 



kiAk2 ^2k~Am 



Y.—^ E «:--W/-2"nW ^ C2(a(Pi),a(P2)) + o(l). 



n 

Recall that for m > 2 the dihedral group D^m can be identified with VMW 

(see the paragraph preceding Lemma 5.4). In particular, for any partition vr G 



2m 



[2m]U[2m] 



GAUSSIAN FLUCTUATIONS 



17 



I?A/'P'Pp™^]u[2fc2]5 there is some g E D^rn such that n = Hg, where TCg is the partition 
with blocks {{(l,r), {2,g{r))} : r = 1, . . . ,2m}. Now take g G -D4m and define for 
any < j < k — 2m 



4T;:2k, := # {vr G vAfvrX 



u[2k2] : TT = TTg, even^TTj 



j}- 



2m, j 



It will emerge from (6.9) below that ^42^^ 2A:2 ^^ independent oi g E I^4m- If m = 1, 
we have to introduce a slightly different notation. This is due to the fact that 
by definition, the set T^J^Wf2]u[2] contains exactly one element given by the 4- 
block {{(1, 1), (1, 2), (2, 1), (2, 2)}}. In particular, any partition tt G VArvrf2k,]u[2k,] 
induces the same reduced partition. With this in mind, we define D4 := {id{i2}}, 
7rid,^,j = {{(l,l),(l,2),(2,l),(2,2)}}, and 

^2fci,2fc2 ■ = i^{Tre I?A/':P7^[2fei]u[2fe2] : TT = TTg, even(7r) = j} 
= # {tt G I?ArpPf2fe,]u[2fc2] • even(7r) = j} . 
So we have 

C2(tr(W'=i),tr(W'=^)) 

kiAk2 2k— Am '^— 2m 

= E ^W E ->'"""^' ^S'k E E c2(a(Po, a(i^2)) + o(i) 



m,=l 

fciAfc2 



n 






k—2m / \ 1 



m=l 



j=0 



/^ 



3eD4m PeM„(7rg) 

+ 0(1), 



4 2m ,i 
2fci,2fc2 



(6.7) 



where 



/i 



k—2m 



Un{2m) ^^^ 



J2 E CMPi),a{P2)). 



To determine C2(a(-Pi), a(P2)), observe that for any g G D^m-, ""^ > 2, vr^ has no 
equivalent elements on the same circle, so we can conclude that E(a(Pj)) = 0. 
Further, all blocks of vr^ are of the form {/,(?(/)}. Hence, for P G M„(7rc,), 

2m 



C2(a(Pi),a(P2)) = \{n<Pi,iy^'^<P2,. 



9(0 



^e(9(r))^ 



1=1 



where a{Pi 



ii 



\eil) 



a{Pi^i) if / is odd and a{Pi 



i,i 



\e{l) 



a{Pi^i) if / is even. On the other 
hand, ii m = 1, condition (B2) yields Pi^i = Pi, 2 and P2,i = P2,2- Thus, we arrive at 

C2(a(Pi),a(P2)) = C2(|a(Pi,i)|Ma(P2,i)|2). 

We then have 



/i 



fe-2 



n^ 



E 



C2{\a{p,q)\ ,\a{p',q')\ ), m 



Un{2m) = < 



fi 



k—2m 



(p,9),(p',g')e[s(n)]x [<(«)], 
(p.|?)~n(p',g') 

2m 



(6.8) 



n 



2m 



E E \{na{Pi,i)'^'^a{P,,^ 



9(0 



1^(9(0) 



), m>2. 



geZ)4m PeM„(7rg) l=i 
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To evaluate the sum in (6.7), which involves the values A^'-'^f, , we decompose a 
partition vr G VAfVV 



2m 
[2fci]U[2fc2 



into two partitions tti and 712 by cutting all links 



between connectors (cf. [7], [H]). To be precise, for z = 1,2 we define 



I ~^- /' :<^=^ {i,l) ~^ {i,l'), (ij), {ij') are not connectors. 

The partitions vr,, z = 1,2, are called non- crossing half pair partitions. In general, a 
non-crossing half pair partition of [k] consists of 2-blocks and 1-blocks, called open 
connectors. Further, if / and /' form a 2-block, then any two points q G ]l,l'[ and 
q' G ]l', l[ are not in the same block and at most one of them is an open connector. 
In analogy to j9], we denote the set of non-crossing half pair partitions of [k] with 
m open connectors by Nl-LWVu . Now take -n G AfTiVV]^! . In accordance with the 

definition of the map even(-) on ^A/'PPpfei]uf2fc2l' *^ P^^ ^'^^ ^^^ ^ ^■^ ^' 



lAiJ') ■■-- 



I, if ]l,l'[ does not contain open connectors, 
/', if ]/', /[ does not contain open connectors. 



and 



even(7r) := #{{/,^'} G n : 7^(/,r) is even}. 



Furthermore, for any < j < /c — ?7i, let 



unvv 



2m,j 



{7rGA/'-HP«i:even(7r)=j}. 



Note that if tt G 1^J^'P'Pf2ki]u[2k2] ^^ decomposed into vri and 7r2 as described above, 
we can reconstruct tt uniquely if we know the structure of the connectors, that is vr. 
Hence we have a bijection 



{tt G I^ArPPf2l]u[2fc2] -^ = ^9, even(7r) = j} 

mm{j,ki—m} 

- u 

i=inax{Oj — (fe2— ni)} 



Arnvvl^,^ X Arnvvl^,i\ 



implying 



4 2m J 
^2fci,2fc2 



min{j,fei— m} 



Y: 4^Mnpv^ 



i=max{0,jr'— (A;2— JTi)} 



4^MHrv\zr- 



(6.9) 
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In particular, ^2Au'2A:2 ^'^^^ ^'^^ depend on g G D^m- Now we have 

fci+fc2 — 2m 

I K \' 

I 1 /, 

2fci,2fc2 



g l^j ^^"^' 



j=0 j=max{0,j-(fc2-m)} ^^^ ^^^ 

:i-m fc2 — m+i / \ * / \ j-i 

E E u #-^^^^S ■ ^ #A^^^^S"' 

:i— m ^ X i k2—m / ^ j 

E [I) ^^-H-p-Pmi ■ E [I) #-^7/pp-;f 



Defining 

k—m / \ j 

Gk,m ■.= Y1 ( - ) #Ar-H^Pfm,,, fc G N, m = 1, . . . , A;, (6.10) 

i=o ^^^ 
we obtain 

C2(tr(W'=^),tr(W^^)) = J2 a2'=^+2fe-4m ^^^^^ ^^^_^ ^^^2m) + o(l). (6.11) 

m=l 
7. COVARIANCES AND ChEBYSHEV POLYNOMIALS 

The aim of this section is to apply the calculations we made so far to compute the 
covariance for shifted and re-scaled Chebyshev polynomials. This will complete the 
proof of Theorem 2^ To this end, we start with the monic Chebyshev polynomials 
{Tk, fc > 1} of the first kind on the interval [—2,2], defined by the trigonometric 
identity 

rfe(2cos(^)) = 2cos(A;^). 

Put T_i{x) := and To{x) := 1. Then the polynomials satisfy the recurrence 
relation 

xTk{x) = Tk+i{x) + (1 + 4,i)Tfc_i(x), fc > 0, (7.1) 

and are orthogonal for the dilated arc-sine law 7=^- A slight modification of 
the (Tfc) yields a family of orthogonal polynomials that has been used by Cabanal- 
Duvillard in [2] to diagonalize the fluctuations of Wishart matrices. Fix y G (0, 00) 
(which will eventually be chosen as k//^), set a := {^/y — 1)^, b := {y/y + 1)^, and 
define for a < x < b: 

T,ix) ■.= Tlix) := v^T, (^ "^"^^^^ ^ , fc > 0. 
Then the polynomials {r^, k > 0} are orthogonal for the shifted arc-sine law 



dx 



on (a, b) and satisfy the recurrence relation 



y/{b-x){x-a) 

xTk{x) = Tk+i{x) + {l + y)Tk{x) + {l + 6,,i)yTk-i{x), k>0, (7.2) 
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where r_i := for convenience. Define the re-scaled versions 



2k 



rk{x,a) = a'T 



X 



a' 



2 / ' 



(T> 0, 



and write 



r,(a:,a) = 5^a^^-^™^^,„,x 



m=0 



where g'j^j^ = l. Let V denote the lower triangular matrix with entries (7^^, that is 



r := 



/I •■■\ 
9'ifl 1 ■■ 

92,0 92,1 1 ■■ 



P-i 



(7.3) 



The inverse of this infinite dimensional matrix can be found by inverting the finite 
principal minors. We then see that T~^ is also a lower triangular matrix. Thus, we 
put 

/I •••\ 
91,0 1 ■■ 

92,0 92,1 1 ■ ■ 

V : ; ; / 

We set gk,k '■= 1 for any /c G N, and gk,m '■= in case m > k. It will be proven in 
the appendix that if y is chosen as n/n, then for any fc G N, m = 1, . . . ,k, one has 
9k,m = Gk,m, wherc the latter was defined in (6.10) above. Combining this result 
with (6.11), we obtain 

C2(tr(r„(W,a)),tr(r,(W,a))) 

m I 

= Y^Y.^"^^"''""''"'' 3'm,k. 9[,k, C,(tr(W'=^),tr(W^^)) 

fci=0fc2=0 
m I kiAk2 

= E E E ^'"^''''' 9k.'P 9k,'P 9L,k. 9\,k. Unm + 0(1) 

fci=0fc2=0 p=l 
oo m I 

= Y^'-^"'~"' Un{2p) J2 E 9L,k. 9kuP 9'i,k, 9k,,, + 0(1). 

p=l ki=0k2=0 

Since ^^^g g'^f, gk,p = Sm,p for any p=l,...,m, this implies 

mAl 

C2(tr(r^(W,a)),tr(rKW,(T))) = ^ a^^+^'-^P S^^p Si,p [/„(2p) + o(l) 

p=i 
= 6^,1 Uni2m) + o{l). 



This is exactly the second part of the statement of Theorem 2.3 
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8. Shortcuts in the proof using the Wigner case 

Many steps in the above proof have run in parallel to the corresponding steps in the 
treatment of the Wigner case that was provided by Schenker and Schulz-Baldes in 
[9] . We have chosen to explain this proof in full detail in order to make our paper 
accessible without assuming familiarity with [9]. Nevertheless it should be noted 
that by representing sample covariance matrices as "chiral" hermitian matrices, i.e. 
as elements of the tangent space to a symmetric space of type AIII (see [4j), one 
may avoid a few pedestrian arguments by citing the corresponding lemmata in [9]. 
This will be explained in the present section, where we will freely use notations and 
results from [5]. 

In order to reduce the case of sample covariance matrices to that of Wigner matrices, 
we define 

Hn := ( ^ bniP, q)) := I ^* p" 

VV"- / p,q=l,...,s(n)+t{n) V ™ 

Hn is a Hermitian {s{n) + t{n)) x {s{n) + t{n)) matrix with E {bn{p, q)) = for any 



{p,q) G [s{n) +t{n)f and, by (1.1) 



sup max E,{\bn{p,q)\ ) = m^ < oo 



neN (p,9)G[s(n)+t(n)]2 



for any /c G N. The idea to consider H„ arises from the relation 

tr (H^ = 2 tr (W^,) , (8.1) 

which allows us to calculate traces of W„ if those of H„ are known. In order 
to apply the results of [Ij, we need to introduce an equivalence relation ~* on 
[s{n) + t^n)]"^ which appropriately describes the correlations between the entries 
{bnip, q),l < p,q < s{n) + t{n)}. Hence, start with the set 

Dn := [s{n)]^ U [s{n) + 1, s{n) +t{n)]^. (8.2) 

If {p, q) G -D„, then 6„(p, g) = 0, and we take {(p, g), {q,p)} to be an equivalence class 
with respect to ~* . Considering, however, the set [s{n) + t(n)]^\D„, we introduce 
a map 

^ : [s{n)+t{n)\\Dn ^ [s{n)\ x [t(n)], 
with 

,// ^^^nP^q-s{n)), iiqe[s{n) + l,s{n)+t{n)], 
\{q-,P- s{n))^ if p G [s(n) + l,s(n) + t(n)]. 



Note that 6„(p,g) = an{^lj{p,q)) if g G [s{n) + l, s{n)+t{n)] and6„(p, g) = an{i>{p,q)) 
if p G [s{n) + 1, s{n) + t{n)]. We thus define (p, g) ~* (p', g') G [s{n) + t{n)]^\Dn if 
and only if ijj{{p, g)) ~„ 'ip{{p', q'))- In accordance with the notation in [5], put 

ao(n) :=#{(p,P,g) G [s{n)+t{n)f : {p,q) ~; {q,p') L p ^ p} , 
a2{n) := max #{(p',g') G [s{n)+t{n)]^ : (p, g) ~; {p',q')} , 

{p,(ir)e[s(n)+i(n)]2 
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and note that ao{n) < 2/3o(^) and 0:2 (^) = 2/^2 (?^). 



To prove the first statement of Theorem |2.3[ we use the multilinearity of cumulants 
and relation (8.1[) to obtain 



C,(tr(W^),...,tr(W^O) = ^C,(tr(H^),...,tr(H^)). (8.3) 

By assumption, a2{n) = 2(32{n) = C(n^) for any e > 0. Hence, H„ satisfies the 



conditions of Theorem 2.1 in |9j, implying that the right hand side of (8.3) is o{l] 
ifj>3. 



To verify the second part of Theorem 2^ , it is not possible to apply the corresponding 
results in p] in a straightforward manner. The difficulty is that the blocks on the 
diagonal of H„ are zero. However, Theorem 2.4 in [11 requires the same variance 
for all entries. Nevertheless, we can use at least parts of the proof to see that the 
covariance can be calculated for any /ci , A;2 G N as 



C2(tr(H::^),tr(H:;^)) 



iE E E C2 n&„(Pi,o,n&n(P2,o +0(1), (8.4) 



n 2 ^=1 ^e2,^ppmj^j^^jPgPS^j^j^^(,^j(^) 

where PS^(^)+t(„)(7r) is the set of all P = {Pi,i)i=i,2,i=i,...,ki = i{Pi,hqi,i))i=i,2,i=i,...M 
satisfying 

(CI) P,,ie[s{n)+t{n)f, 

(C2) qn = Pi i+i, where fcj + 1 is identified with 1, 

(C3) {i,l)r^U^',n^^P^,l<P^',l', 

(C4) if {i, I) ~^ {i, I'), we have P^ = {p, q) and Pi^v = {q,p) with p ^ q. 

To circumvent the problem of non-identical variances, the idea is to simply eliminate 
those entries which are equal to zero. Thus, as at the beginning of this section, we 
put Dn := [s(n)]^ U [s{n) + 1, s{n) +t(n)]^. In particular, bn{p,q) = if and only 
if {p,q) G Dn- Denote by PS*(„)_,_j(„)(7r) the set of all P G PSs(„)+i(„)(7r) such that 

(*) {Pi,h qi,i) e [s{n) + t{n)]'^\Dn for alH = 1, 2; / = 1, . . . , ki. 
Now P G PS,(n)+t(n)(7r)\PS*(„)+i(„)(7r) implies 

C2 n^"(PM),n^"(P2,/) =0. 



We can thus consider the set 'PS*s(n)+t{n){'^) instead of PSs{n)+t{n){'^) in equation (8.4). 
Note that relation (|8.1) yields 



C2(tr(W^^),tr(W^^)) = 1 C2(tr(H^),tr(H^^^)). 
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Furthermore there are no dihedral partitions of [2ki] U [2k2] with an odd number of 
connectors, that is ^■A/'P'Ppj^. lypfc^i = whenever m is odd. Hence, we find that 

, fciAfcz /2fci 2fc2 \ 

^^^j^j: e e cJn^«(^^^0'n^«(^^^o +-w• 
(8.5) 

To recover the results from the previous sections, we need to describe the sets 
PS*(„)_i_j(„) (tt) in terms of PM„(7r). To this end, define 

and 

PM„ := U PM„(r7). 

In order to compare PM„ with PS*(-„-)_,_4(-„-), introduce a map 

$ = (v^i, v^2) : PS:(„)+,(„) ^ PM„, P ^ $(P) = iMPi),MP2)), 

where for any P = (Pi,/)i=i,2, i=i,...,2ki, we put P, := (Pi,i)i=i,...,2fc, {i = 1,2). The aim 
is to define $ in such a way that the covariances are invariant under this mapping, 
that is for any Q = $(P), we wish to have 

(2fci 2fc2 \ 

llUPi,i),llUP2,i)] =C2(a„(Qi),a„(Q2)), (8.6) 

1=1 1=1 J 



where an{Qi) = YliLi 0'n{Qi,2i~i)0'n{Qi,2i) ■ This can be achieved as follows: 

Fix n e ^AAPPgi]u[2fe2] and P = {{pi,i,qi,i))i=i,2, i=i,...,2k, e PS*(„)+i(„)(7r). By def- 
inition of PS*(„)+4(„)(7r), P = {{pi,i,Pi,i+i))i=i,2, 1=1,.. .,2k,, where 2/^^ + 1 is identified 
with 1. Furthermore, (*) guarantees that for any ? = 1, 2, either 

(I) Pi,2i-i e [s{n)] and pi^2i e [s(n) + 1, s{n) + t(n)] for any /, 
or 

(11) pi^2i e [s{n)] and pj,2Z-i e [s(n) + 1, s(n) + t(n)] for any /. 

If case (I) holds, we define 

^i{Pi)2i-i = {Pi,2i-i,Pi,2i - s{n)), ipi{Pi)2i = {Pi,2i+i,Pi,2i - s{n)). 
On the other hand, if (II) holds, we put 

fi{Pi)2i+i = {Pi,2i,Pi,2i+i - s{n)), (fi{Pi)2i = {pi,2i,Pi,2i-i - s{n)). 

Note that in the latter case, (fi shifts all pairs by 1 to the right. Otherwise, condition 
(B2) would not hold. Hence $(P) = {ipi{Pi), ^2{P2)) ^ PM„(r7) for some partition 77 
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which might be different from vr. However, $ sends adjacent connectors to adjacent 
connectors. Thus rj G ^A/'P'PSfciiupte]' implying that $ indeed maps to PM„. In 
particular, note that for any element Q G PM„, we have 

#{P G PS:(„)^,(„) : <I>(P) = g} = 4. 

Now put Q = <^{P). Then 

]^&n(-Pi,i) = ]^an(Qi,2Z-l) ■ a'n{Qi,2l) = dniQi), 
1=1 1=1 



regardless of whether (I) or (II) holds. Consequently, we have the identity in (8.6) 
implying that the covariance depends only on the image of P under $. To sum up 
our results, we obtain 

(2ki 2fc2 

ll^n{Pl,l),l[^n{P2,l 

= ^ Yl Yl C2(a„(Qi),a„(Q2)). 

^eCATPPf^^j^p.^j QGPM4-) 



Thus, equation (8.5) becomes 

fciAfc2 



C2(tr(W^^),tr(W^^)) = -^X; E E C2(a„(P0,a„(P2)) + o(l) 



"^=1 -e^ATPPfm^i^p,^, PGPM„(^) 



This is exactly equation (6.4) above. Starting from that, we may now complete the 



proof of Theorem 2^ as above in Sections |6] and [7j 



Appendix A. Half pair partitions and Chebyshev polynomials 



In this appendix we show that a quantity that was defined in (6.10[) in terms of 



non-crossing half pair partitions and a quantity that was defined in (7.3) in terms 
of Chebyshev polynomials are in fact equal. 

Proposition A.l. For any k > 1 and 1 < m < k, we have 



The ideas of the proof are similar to those presented in [7], Theorem 25 and 27. The 
first step is to provide a combinatorial description of the coefficients gk,m, k > 0, 
< m < k. Thus we take A;>0, 0<?7i</i; and < j < k — m, and denote by 
Dj,m,k the set of all dot structures of white and black dots on the set [2k] such that 

• there are j black dots on odd numbers and the remaining k — j odd numbers 
have white dots, 

• there are m + j white dots on even numbers and the remaining k — {m + j) 
even numbers have black dots. 
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Put Djm,k = 0iij<0oTJ>k — m. We now have the following key fact. 
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Proposition A. 2. For any k > 1 and < m < k, we have 

k—m k—m 

9k,m = ^y^ i^Dj^rn,k = ^ V^ 
j=0 j=0 



k 

m + j 



Proof. The second equality is obvious. To prove the first one, we will show that 
both sides satisfy the same recurrence relations. Those for the left hand side follow 



from the recurrence (7.2) of the Tk polynomials. In fact, using the matrices 

/O 
2 



S :-- 



/O 1 



\ ■•• 




1 







1 


■■•/ 



T:= 



1 







1 










\ 



V 



•7 



equation (7.2) can be written as 

TS = ST + {l + y)T + yTT, 

implying 

ST-^ = T~^S + (1 + y)T'^ + yV-^T. 
In terms of the entries gk,m, this yields for any k > 1, 

gk+i,m = gk,m-i + (1 + y)gk,m + ygk,m+i, m > 1, 
and 

gk+1,0 = (1 + y)gkfl + '^ygk,i- 

Further, we have gi^o = 1 + y and gi^i = 1. Now we need to show that (A.l) and 



(A.l) 
(A.2) 



(A. 2) hold for gk-. 



into four distinct subsets 



To this end, we divide Dj^rn,k+i ior k > 1 



^jmk+i '■= {v ^ Dj^rn,k+i '■ the dots 2k + 1 and 2A; + 2 are white}. 



D 



(2) 



j,m,k+l ' 

j,Tn,k+l ' 
)(4) 



{rj G Dj^m,k+i '■ the dot 2 A; + 1 is black, the dot 2/i; + 2 is white}, 

{rj G Dj^rn,k+i '■ the dot 2A; + 1 is white, the dot 2A; + 2 is black}, 

^Tm k+1 •= {^ ^ Djm,k+i '■ the dots 2k + 1 and 2A; + 2 are black}, 
and, for m > 1, we construct four bijections 

^^'^ ■■ D^Lmi ^ D,,^-^,k, 
^^'^ ■ Dflk^, ^ D,-i,^,., 



,(4) ._ Jji^) 



-^ D, 



^ ' ■ ^j,m,k+l ~^ J-^j-l,m+l,k- 

Each map (f^^\ i = 1, ... ,4, simply deletes the dots 2k + 2 and 2k + 1. In case 
i = 1, for example, we have that two white dots are eliminated, one dot on an even 
number and one dot on an odd number. Thus, there are 2k dots left, j black dots 
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on odd numbers and m + j — 1 white dots on even numbers. In particular, (^^^^ maps 
to Dj^m-i,k, and is obviously bijective. Similar considerations can be made for (f^'^\ 
if^^^ and ip^^\ We can thus conclude that 

implying 

gk+l,m = gk,m-l + Z/^fc,m + ^fc,m + ygk,m+l 

= 9k,m-l + (1 + y)9k,m + ygk,m+i- 

Now let m = 0. The maps 'f^'^\ v^^^-* and v?*-^^ can be defined as above. However, we 
want to change the definition of v?'-^-' slightly to obtain the map 

which first erases the dots 2k + 2 and 2/c + 1, so we have j black and k — j white dots 
left on odd numbers, and j — 1 white and /c — (j — 1) black dots on even numbers. 
Then, (p^^^ reverses the color of all dots and afterwards, shifts them by 1 in clockwise 
direction. Now, there are j — 1 black and k — {j — 1) white dots on odd numbers. 
On even numbers, we have J = 1 + (j — 1) white and k — j = k — {l + {j — 1)) black 

dots. Thus, we see that (^^^^ is a bijection from -0^0^+1 ^° -^i-i,i,'c- '^° ^^^^ ^P' ^^ 
have 

#-Dj,o,fc+i = #-Dj-i,i,fe + i^Dj^ifi^k + ihDjfi,k + #-Dj_i,i,fe- 

In particular, 

^fc+1,0 = ygk,i + ygkfl + gk^ + ygk,i 

= (^ + y)gk,o + 2ygk,i. 



Since the recurrence relations (A.l) and (A. 2) now hold for {gk,mik > 1,0 < m < k} 



it remains to check that giQ = gi^ = 1 + y and gi^i = gi^i = 1. But this can be 
simply calculated as 

^1,0 = y' #^0,0,1 + y' #^1,0,1 = i + y, gi,i = 2/° #^0,1,1 = i- 



This completes the proof of Proposition |A.2 



D 



In view of the definition of G^.m in (6.10), with the choice y = k/jj,, Proposition A.l 



is proven once the following lemma is established. 

Lemma A. 3. For any k>l,l<m<k and < j < k — m, there is a bijection 
between AfHW ^2k\ 0,''^'^ Dj^rn,k■ 
Proof. To any given vr G N'T-LWri^f , we assign a dot structure in the following way: 

1. If / ~,r !■' and '^-,^{1, 1') = I, then color /' black. 

2. Color the remaining dots white. 
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Figure 2. In this example, we take k=6, ni=2, j=2 and n = 
{{1,12}, {2,11}, {3}, {4}, {5,6}, {7}, {8,9}, {10}} e MnVV\^y We 
color the end point of any 2-block black, i.e. the points 1, 2, 6, 9. The 
last picture shows the resulting dot structure which is in -D2,2,6- 



Since we have exactly j even and k — {m+j) odd pairs, the construction above leads 
to a dot structure which is in Dj^rn,k- To obtain an inverse mapping, we start at 
any black dot and connect it to the first available white dot when moving counter- 
clockwise. Here, available means that every time we pass over a black dot we must 
skip over an additional white dot. Note that by this procedure, we only connect odd 
numbers with even, and even numbers with odd. Clearly, we obtain a partition vr 
in Ml-LWl'^Y Further, we have that if / ~7r I' with 77r(/, /') = /, then / was a white 
dot and /' a black one. Thus even(7r) is equal to the number of black dots on odd 



numbers, that is evenfTr) 



j. In particular, vr G Nl-iW ^2k] . 



U 
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